Novel Kernel-Based Recognizers of Human Actions
نویسندگان
چکیده
We study unsupervised and supervised recognition of human actions in video sequences. The videos are represented by probability distributions and then meaningfully compared in a probabilistic framework. We introduce two novel approaches outperforming state-of-the-art algorithms when tested on the KTH and Weizmann public datasets: an unsupervised nonparametric kernel-based method exploiting the Maximum Mean Discrepancy test statistic; and a supervised method based on Support Vector Machine with a characteristic kernel specifically tailored to histogram-based information.
منابع مشابه
Correction: novel Kernel-based recognizers of human actions
The approach is taken further by Schindler and Van Gool [24], who investigated the detection of actions from very short sequences called snippets. Two separate pathways for motion and shape are considered. Motion is modeled by means of optical flow, computed for different directions and scales. Shape is represented by Gabor filter responses. MAX-pooling and comparison with a set of templates (l...
متن کاملUsing a Novel Concept of Potential Pixel Energy for Object Tracking
Abstract In this paper, we propose a new method for kernel based object tracking which tracks the complete non rigid object. Definition the union image blob and mapping it to a new representation which we named as potential pixels matrix are the main part of tracking algorithm. The union image blob is constructed by expanding the previous object region based on the histogram feature. The pote...
متن کاملPhonotactic language recognition based on time-gap-weighted lattice kernels
Phonotactic method for spoken language recognition (SLR) deals with permissible phone patterns and their frequencies of occurrence in a specific language. Phone recognizers followed by vector space models (PR-VSM) system is a state-of-the-art phonotactic language identification system, in which any utterance can be mapped into a supervector filled with likelihood scores of the n-gram tokens (ba...
متن کاملGeneralizing Manipulations using Vision Kernels
In order to perform complex manipulation tasks, a robot must know which actions it can perform with the available objects. In unstructured environments, potential manipulations afforded by objects will not be pre-specified, and must instead be learned. Rather than determining each novel object’s affordances from scratch, the robot can learn more efficiently by generalizing manipulations from si...
متن کاملRecognizers: Arguments and Design Decisions
The Forth text interpreter processes words and numbers. Currently the set of words can be extended by programmers, but not the recognized numbers. User-defined recognizers allow to extend the number-recognizer part, too. This paper shows the benefits of recognizers and discusses counterarguments. It also discusses several design decisions: Whether to define temporary words, or a set of interpre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- EURASIP J. Adv. Sig. Proc.
دوره 2010 شماره
صفحات -
تاریخ انتشار 2010